KiaDev Intelligence

#synthetic data augmentation04/07/2025

Crome: Google DeepMind's Causal Framework Enhances Reward Modeling for Safer LLM Alignment

Google DeepMind and collaborators introduce Crome, a causal framework that improves reward modeling robustness in LLM alignment by using counterfactual data augmentation to tackle reward hacking issues.

READ →